DATA SECURITY SYSTEM AND METHOD 
[0001] This patent application is based upon and claims priority of provisional patent 

application No. 60/260,398, filed June 5, 2001; application No. 60/287,813, filed on May, 2, 2001; 
application No. 60/267,944, filed February 12, 2001; application No. 60/247,242, filed November 
13, 2000 and application No. 60/247,232, filed November 13, 2000. 

[0002] The present mvention relates to a data security system and method and, more 

specifically, the filtering, extraction and conti-oUed release of data utiUzing security protocols. 

Background of the Invention 
[0003] The extensive use of computers and the continued expansion of telecommunications 

networks, particularly the Internet, enable businesses, governments and individuals to create 
documents (whether text, images or a combination thereof, sometimes identified by others as "data 
obj ects") and distribute those documents widely to others. Although the production, distribution and 
publication of documents is generally beneficial to society, there is a need to limit the distribution 
and publication of security sensitive words, characters or icons. Concerns regarding the privacy of 
certain data (for example, an individual's social security number, credit history, medical history, 
busmess trade secrets and financial data) is an important issue in society. In another words, 
individuals and businesses have a greater concern regarding maintaining the secrecy of certain 
information in view of the mcreasing ease of distribution of documents through computer networks 
and the Internet. 

[0004] U.S. Patent No. 6,055,544 to DeRose et al. discloses the generation of chunks of a 

long document for an electronic book system. DeRose '544 discloses solutions available to book 
publishers to pubhsh books in electronic format on the worldwide web. One of the problems is that 
the books are published as small document fragments ratiier than pubUshing an entire book which, 
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due to the formatting, protocol and command structure on the Internet, downloads an entire book to 
the user. The problem involved with publishing small documents is that there is no relationship to 
other portions of the book. See col. 3, lines 5 1-55 and col. 4, lines 3-5. One methodology to solve 
the problem involves inserting hypertext links in the book. This places a large burden on the book 
publisher. Col 4, lines 19-21 . Accordingly, it is an object of DeRose '544 to provide a mechanism 
for accessing only a portion of a large, electronically published document and automatically 
determining what portion of the document to download to the user based upon user selections that 
is, previous portions and subsequent portions of the document are downloaded with the selected 
portion, without maintaining separate data files for each portion of the document. Col 4, lines 34- 
39. In other words, if a person wanted to access chapter 4 of a text, the system in DeRose '544 
would display chapter 4, chapter 3 (the preceding chapter) and chapter 5 (the subsequent chapter). 
This publishing of portions of the document utilizes a subset of marked up elements established as 
being significant and a second subset of elements being less significant. For example, "Title 
elements" define a table of contents. A first representation of the document structure defined by all 
of the marked up elements may be used in combination with a second representation of the document 
structure defined only by the significant elements to control selection of portions of the documents 
such that previous and subsequent portions may be selected and rendered in a consistent and intuitive 
manner " Col. 4, lines 38-55. A computer system stores a first representation of the hierarchy of 
all elements in the electronic document. As example, this may be each chapter in its entirety. The 
computer also stores a second representation of the hierarchy of only significant elements in the 
electronic document. As an example, this may be a listing of each chapter without the text 
associated with the chapter. In response to request for a portion of the doctiment, the computer 
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system selects the portion defined by the significant element in the second representation. For 
example, if the user requested chapter 4, the entirety of chapter 4 would be downloaded from the 
web server to the client computer. In addition to rendering or publishing the selected chapter, the 
computer system looks to the relationship of the elements in the first representation of the hierarchy 
(the list of all chapters) and downloads fi"om the web server the adjacent chapters. In this example, 
this would involve downloading chapters 3 and chapter 5. In a further embodiment, the computer 
system selects only a leaf element of the second representation as a significant element during the 
dovmload. See the Summary of the Invention, col. 4, line 40 through col 6, line 14. 
[0005] U.S. PatentNo. 5,832,212 to Cragunetal. discloses a censoring browser method for 

viewing downloaded and downloading Internet documents. The abstract describes the system as 
including a user profile including user selected censoring parameters. Data packet contents are 
received from the Internet and the packets are compared with the user selected censoring parameters. 
Responsive to the comparison, the received data packet contents are processed and selectively 
displayed. The user selected censoring parameters include censored words and word fragments, and 
user selected categories. Compared word and word fragments can be removed and selectively 
replaced with predefined characters or acceptable substitute words. Tallies of weights for user 
selected categories are accumulated and compared with used selected threshold values, A predefined 
message can be displayed responsive to an accumulated tally exceeding a user selected threshold 
value without displaying the received data packet contents. 
[0006] U.S. Patent No. 6,094,483 to Fridrich discloses an encryption methodology hiding 

data and messages in images. In one application of the system in Fridrich '483, a method is 
disclosed of embedding a secret digital square image with 256 gray levels within an image carrier. 
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The secret image is first encrypted using a chaotic Baker map. The resulting image is a random 
collection of pixels with randomly distributed gray levels without any spatial correlations. The 
carrier image which is twice the size (height and width or 2nX2m) the secret image with 256 gray 
levels. The carrier image is modified according to a mathematical formula. 
[0007] U.S. Patent No. 5,485,474 to Rabin discloses a scheme for information dispersal and 

reconstruction. Information to be transmitted or stored is represented as N elements of a field or a 
computational structure. These N characters of information are grouped into a set of n pieces, each 
containing m characters, col. 1, lines 37-46. The system is used for fault tolerance storage in a 
partitioned or distributed memory system. Information is disbursed into n pieces so that any m 
pieces suffice for reconstruction. The pieces are stored in different parts of the memory storage 
medium, A fairly complex mathematical algorithm is utilized to provide reconstruction of the 
information utilizing no fewer than m pieces, 

[0008] U.S. Patent No. 6,192,472 B 1 to Garay et al. discloses a method and apparatus for the 

secure distributed storage and retrieval of information. Garay '472 identifies the problem as how 
to store information in view of random hardware or telecommunications failures. CoL 1, lines 17- 
20. The initial solution is to replicate the stored data in multiple locations. CoL 1, lines 28-31. 
Another solution is to disburse the information utilizing in Information Disbursal Algorithm (IDA). 
The basic approach taking in IDA is to distribute the information F being stored among n active 
processors in such a way that the retrieval of F is possible even in the presence of up to t failed 
(inactive) processors. CoL 1, lines 40-44. Another issue is the utilization of cryptographic tools. 
With the use of tools called distributed fingerprints (hashes), the stored data is distributed using the 
fingerprints and coding fimctions to determine errors. In this way, the correct processors are able 
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to reconstruct the fingerprint using the code's decoding function, check whether the pieces of the file 
F were correctly returned, and finally reconstruct F fi*om the correct pieces using the IDA algorithm. 
Col. 2, lines 50-59. Garay '472 also discloses the use of Secure Storage and Retrieval of Information 
(SSRI) with the added requirement of confidentiality of information. Col. 3, line 56. With this 
added requirement, any collision of up to t processors (except ones including the rightfiil owner of 
the information) should not be able to leam anything about the information. Confidentiality of 
information is easily achieved by encryption. Col. 3, lines 56-6 1 , The issue involves encryption key 
management, that is, the safe deposit of cryptographic keys. Garay '472 discloses confidentiality 
protocol utilizing distributed key management features. This mechanism allows the user to keep his 
or her decryption key shared among several n servers in such a way that when the user wants to 
decrypt a given encrypted text, the user would have to interact with a single server (the gateway) to 
obtain the matching plaintext while none of the servers (including the gateway) gets any information 
about the plaintext. Col 4, lines 5-14. 

[0009] U.S. Patent No. 5,996,01 1 to Humes discloses a system and a method for filtering 

data received over the Internet by a client computer. The system restricts access to objectionable or 
target data received by a client computer over an Internet by a web server by filtering objectionable 
data from the data received. The Humes '01 1 system filters the data "on the fly." Further, the 
Humes 'Oil system can be applied to process any type of target data from the data received and 
displayed to the user. Col. 2, lines 32-44. If the web page requested by the user contains only a 
minimum amount of objectionable or target data, the user receives only a portion of the filtered web 
page for viewing. Hume 'Oil also provides that if the web page contains a large amount of 
objectionable material, the system blocks the entire display of the web page on the user's computer 
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monitor. Col. 2, lines 56-62. Hume '01 1 provides three levels of filtering. At the first level, if the 
domain name contains objectionable words or material, the initial download from the domain is 
blocked. At the second level, the text in the download is filtered and objectionable words are 
replaced with a predetermined icon, for example, — CoL 3, lines 32-35. The filter uses a 
dictionary. Col. 3, lines 45-48. The filtered out words are counted. If the final score of "filtered 
out" material exceeds a predetermined threshold, the entire page is blocked from the user's view. 
CoL 4, lines 2-4. 

[0010] U.S. Patent No. 5,905,980 to Masuichi, et al., discloses a document processing 

apparatus for processing various types of documents, a word extracting apparatus for extracting a 
word from a text item including plural words, a word extracting method used in the document 
processing apparatus, and a storage medium for storing a word extracting program. Extracted words 
are associated with other words via an algorithm. The extracted words and associated words are 
used as a search index for the document. 

[0011] U.S. PatentNo. 5,996,011 to Humes discloses a computer based system and method 

for filtering data received by a computer system, and in particular, for filtering text data from World 
Wide Web pages received by a computer connected to the Intemet, for purposes of restricting access 
to objectionable web sites. 

[0012] U.S. PatentNo. 6, 148,342 to Ho discloses a system for managing sensitive data. The 

system prevents a system administrator from accessing sensitive data by storing data and identifier 
information on different computer systems. Each query from a user's terminal is encrypted using 
two codes, the first code readable only by an identifier database and a second code readable only by 
a data access database. The data is routed from the user's source terminal to the identifier database 
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at the first computer. The first computer/identifier database first verifies the user's ID and the 
security clearance for the requested information and substitutes a second internal ID to the data 
packet/query. The modified query is then presented to the data access database (the second 
computer) and, subject to a second security clearance, the response to the data query is sent back to 
the user's source terminal. 

[0013] A publication entitled "Element- Wise XML Encryption" by H. Maruyama T. 

Imamura, published by IBM Research, Tokyo Research Laboratory, April 20, 2000 discloses a 
protocol or process wherein certain parts of an XML document are encrypted and the balance of the 
plaintext is not encrypted. The protocol is usefiil in three party transactions, for example, when a 
buyer sends an order in an XML document to a merchant which contains the buyer's credit card 
information. The credit card information is sent to a credit company and the merchant does not need 
to know the credit number as long as he obtains clearance or authorization from the credit card 
company. Another instance is an access control policy which requires a certain part of an XML 
document to be readable only by a privileged user (for example, a manager could access the salary 
field in an employee records but others could only access name, phone and office fields). The 
Imamura article discusses encryption protocol, the delivery of keys and the utilization of 
compression. The article does not discuss separate storage of the critical data apart from the 
plaintext of the XML document. 
* [0014] The Ingrain ilOO Content Security Appliance product brochure, available in June, 

2001, discloses a system coupled to multiple web servers (computers) to accelerate secured 
transactions between multiple client computers (over the Internet) and prevents Secure Sockets Layer 
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SSL performance bottlenecks by performing high-performance SSL handshakes and encrypting all 
data sent to back end servers using long-lived SSL session. 

[0015] An article entitled "Survivable Information Storage Systems" by J. Wylie M, Bigrigg, 

J. Strunk, G. Ganger, H. Kiliccote, and P. Khosla, published August, 2000 in COMPUTER, pp. 61- 
67, discloses a PASIS architecture which combines decentralized storage system technologies, data 
redundancy and encoding and dynamic self-maintenance to create survivable information storage. 
The Bigrigg article states that to achieve survivability, storage systems must be decentralized and 
must spread information among independent storage nodes. The decentralized storage systems 
partition information among nodes using data distribution and redundancy schemes commonly 
associated v^^ith disc array system such as RAID (redundancy array of hidependent discs) insuring 
scalable performance for tolerance. P. 61. Thresholding schemes - also known as secret sharing 
schemes or information disbursal protocols - offer an alternative to these approaches which provide 
both information confidentiality and availability. These schemes and codes, replicate, and divide 
information to multiple pieces or shares that can be stored at different storage nodes. The system 
can only reconstruct the information when enough shares are available, P. 62. The PASIS 
architecture combines decentralized storage systems, data redundancy and encoding and dynamic 
self-maintenance to achieve survivable information storage. The PASIS system uses threshold 
schemes to spread information across a decentralized collection of storage nodes. Client-side agents 
communicate with the collection of storage node to read and write information, hiding 
decentralization from the client system. P. 62. The device maintains unscrubable audit logs —that 
is, they cannot be erased by client-side intruders — security personal can use the logs to partially 
identify the propagation of intruder-tainted information around the system. P. 63. The article states 
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that, as with any distributed storage system, PASIS requires a mechanism that translates object 
names - for example file names ~ to storage locations. A directory service maps the names of 
information objects stored in a PASIS system to the names of the shares that comprised the 
information object. A share's name has two parts: the name of the storage node on which the share 
is located and the local name of the share on the storage node. A PASIS file system can embed the 
information needed for this translation in directory entries. P,63. To service a read request, the 
PASIS call client (a) looks up in the directory service the names of the n shares that comprise the 
object; (b) sends read requests to at least m of the n storage nodes; (c) collects the responses and 
continues to collect the responses until the client has collected m distinct shares; and (d) performs 
the appropriate threshold operation on the received shares to reconstruct the original information. 
P. 63. The p-m-n general threshold scheme breaks information into n shares so that (a) every 
shareholder has one of the n shares; (b) any m of the shareholders can reconstruct the information; 
and (c) a group of fewer than p shareholders gains no information. P. 64. Secret-sharing schemes 
are m-m-n threshold schemes that trade off information confidentiality and information availability: 
the higher the confidentiality guaranty, the more shares are required to reconstruct the original 
information object. Secret sharing schemes can be' thought of as a combination of splitting and 
replication techniques. P. 64. The article discusses the technique of decimation which divides 
information objects into n pieces and stores each piece separately. Decimation decreases information 
availability because all shares must be available. It offers no information theoretic confidentiality 
because each share expresses 1/n of the original information. P. 64. Short secret sharing encrypts 
the original information with a random key, stores the encryption key using secret sharing, and stores 
the encrypted information using information disbursal. P. 64, An extension to the threshold 
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schemes is cheater detection. In a threshold scheme that provides cheater detection, shares are 
constructed in such a fashion that a chent reconstructing the original information object can tell, with 
high probability, whether any shares have been modified. This technique allows strong information 
integrity guarantees. Cheater detection can also be implemented using cryptographic techniques such 
as adding digest to information before storing it. P, 65. For the highest architecture to be effective 
as possible, it must make the full flexibility of threshold schemes available to clients. The article 
believes this option requires automated selection of appropriate threshold schemes on a per object 
basis. This selection would combine object characteristics and observations about the current system 
environment. For example, a client would use short secret sharing protocol to store an object larger 
than a particular size and conventional secret sharing protocol to store smaller objects. The size that 
determines which threshold scheme to use could be a function of object type, current system 
performance, or both. P. 67. 

Objects of the Invention 

[0016] It is an object of the present invention to provide a data security system, an 

information processing system and a method for securely storing data and rebuilding that data in the 
presence of an adequate security clearance. 

[0017] It is another object of the present invention to provide a method for securing data on 

a single personal computer (PC), on a plurality of computers linked together through a local area 
network (LAN) or a wide area Network (WAN). 

[0018] It is a further object of the present invention to provide a method for securing data 

utilizing a client-server computer system. The client-server computer system may be implemented 
over the Internet. 
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[0019] It is a further object of the present invention to provide a method for securing data 

which is highly flexible and programmable by a user. 

[0020] It is an additional obj ect of the present invention to enable the user to establish (a) the 

scope of the security sensitive words, characters or icons, (b) the future use (or destruction) of a filter 
enabling extraction of security sensitive data, (c) the selection of storage locations for extracted data 
and remainder or common data and (d) one or multiple levels of security limiting full reconstruction 
and enabling partial reconstruction of the extracted data and the remainder or common data. 
[0021] These steps may be completely automated (after some parameters are set or 

programmed by a system administrator), may be fully programmable by the user, or may be a 
combination of automated and manual controls implemented by the systems administrator and/or 
the user-client. 

Summary of the Invention 
[0022] The method for securing data in a computer system includes establishing a group of 

security sensitive words, characters or icons, filtering the data input from a data input device and 
extracting the security sensitive data. The extracted data is separated from the remainder data and 
is separately stored. In one embodiment on a personal computer (PC) system, the extracted data and 
the remainder or common data is stored in different, distributed memory segments. In a network 
implementation, the extracted data may be stored in one computer and the remainder or common 
data may be stored in another computer. In a client-server implementation, the server may direct 
storage of the extracted data to a different location than the remainder data, either on the server or 
on a further memory system (computer) interconnected to the server or on the client computer and 
in distributed memory segments, A map may be generated by a software module or sub-system 
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indicating the location of the extracted data and the remainder data in the network. The filter may 
be destroyed (via a deletion routine) or may be retained for future use by the user. The map may be 
stored on the client computer or the user's PC or may be stored on the server. Copies of the map 
may be removed (deleted) from the user's PC or the client computer. The extracted data and/or the 
remainder data may be removed (deleted or scrubbed) from the originating computer Encryption 
can be utilized to further enhance the security levels of the system. All transfers of the filter between 
the client to the server may be encrypted, and all data (whether extracted data or remainder data) may 
be encrypted prior to storage in the distributed memory. Any transfer of extracted data or remainder 
data or maps or filters may include an encryption feature. Reconstruction of the data is permitted 
only in the presence of a predetermined security clearance. A plurality of security clearances might 
be required which would enable a corresponding plurality of reconstructing users to view all or 
portions of the data. Persons with low level security clearance would only be permitted to have 
access to low level extracted data (low level security sensitive data) and the common data. Persons 
with high level security clearances would be permitted access to the entire document reconstituted 
from the extracted data and the remainder data. A computer readable medium containing 
progranoming instructions carrying out the methodology for securing data is also described herein. 
An information processing system for securing data is also described. 

[0023] The present invention can be configured in various forms. The following descriptions 

discuss various aspects of the invention and further advantages thereof. 

[0024] The present invention enables automatic classification and declassification of 

documents on the fly. The extraction process downgrades and declassifies documents on the fly (in 
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real time) so that they are useless to unauthorized parties. Presentation by a user of a valid security 
clearance enables substantially instant and seamless reconstitution of the security sensitive content, 
[0025] The present invention automatically secures unstructured documents and freeform 

documents for example, e-mail, instant messaging, or Word documents (input documents). 
[0026] The present invention automatically secures structured documents and transactional 

documents for example, database records or XML documents (input documents). 
[0027] The present invention introduces flexibility into security management, risk 

management of data, data storage, and data flows and enable automatic responsiveness to threats. 
The innovation enables automatic response to security challenges and threats. The innovation can 
maintain, upgrade and downgrade the levels of security through implementation of a leveled granular 
extraction process and a controUed-release mechanism. Attacks or other external events can trigger 
a response in the form of higher extraction levels, expanding the type of content extracted, and 
constricting the release of important and critical data control from storage. How much and what to 
extract depends on the level of threat or perceived risk. In same manner, the amount and type of 
content released from storage and reconstituted depends on the level of threat or risk perceived by 
the system. The system delivers a level of security protection specifically matched to meet security 
needs as dictated by the changing security threats, environment, policy and organizational needs. 
(0028] The present invention introduces multiple levels and standards of security. It is 

common knowledge that the highest security is delivered through total separation. Whereas this 
concept has only been implemented physically or by isolating computer environments, the invention 
achieves this concept of total separation within open and networked computer environments. The 
invention can implement a total physical and logical separation of important and critical data from 
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its context and can preclude access to that information without a needed granular access permission. 
The invention is also effective for sounds and images (data objects as security words, characters, 
terms or icons). 

[0029] The present invention introduces a new method and apparatus to monitor security 

sensitive content through a process of analysis and categorization of each word or character, in a 
document. The invention enables processing of every character, word, number, as they are entered 
into a document and categorizes each into one of many pre- set categories. Categories can include 
surnames, locations, currency, defined terminology, and unknovm words or phrases. 
[0030] The present invention introduces a method and apparatus for plain text extraction and 

dispersion of security sensitive data. Maximum security with traditional methods encumbers free 
flow of information and business efficiency. Encryption burdens computer systems with high 
performance overhead, and its use is limited to the parties who have decryption capabilities. The 
invention offers a new solution. It enables leveled security in plain-text format, in addition to none, 
some, or all of pre-existing encryption, decryption, firewalls, and other security infi:astructure. The 
level of security is determined by the extent of the security sensitive items, selection process; the 
extent of dispersal to various distributed storage locations; the rules for controUed-release from 
storage; and the access rules governing the reconstitution of extracts into the secured document. 
[0031] The extractions are dispersed to distributed storage on a granular level. The rest of 

the document can be stored at its original location and/or other storage locations. Dispersal of 
extractions introduces new barriers not existing in current security. In certain situations, an attacker 
has first to find the (encrypted) map to the locations, then locate and access the distributed storage, 
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get the data released from the controlled-reiease storage, and finally reintegrate the extracts into the 
appropriate documents. 

[0032 ] The present invention introduces a method and apparatus for targeted extraction and 

encryption of security sensitive items. The extraction capabilities of the system enable different 
workflow modes. The system enables extraction and encryption of important and critical content. 
In essence, only the critical content is extracted and/or encrypted, whereas the rest of the document 
remains as plaintext. This capability enables the following: advantages and flexibility; and the ability 
to share the document within the organization or transmit it to outsiders while still maintaining 
security over the most important and critical content of the document. This is an automatic process 
for controlling the content of outgoing e-mail The document owner releases the important and 
critical content by enabling access to it to defined parties at defined times within defined threat 
modes. 

[0033] The present invention introduces a method and apparatus for encrypting document 

or extractions with multiple encryption types. The invention can deliver the highest level of security 
by using multiple types of encryption (and/or multiple keys) for one line, paragraph or document. 
Maximum security is delivered through automatic selection of security sensitive items, and 
encrypting these extractions with one or more types of encryption. The remainder data can also be 
encrypted. Multiple encryption types within one document statistically precludes deciphering that 
document regardless of the available computer power. Common encryption methods are vulnerable 
through existing technologies, social engineering methods, carelessness, and workflow habits. 
Furthermore, simple encryption becomes more vulnerable (including triple DES) assuming future 
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mathematical breakthroughs or quantum computing. Existing methods to crack block ciphers are 
being improved to compromise the future AES Rinjdael standard. 

[0034] The present invention to introduce a method and apparatus for content dispersion. 

The innovation enables control over specific important and critical content items within the general 
contents of documents or digital files in a computer or within a network. The immediate controlled- 
release of those important content items according to specific identification and access criteria 
proactively safeguards the security and the value of documents or digital files. The content control 
enables broad dissemination of the digital files in closed networks, as well as open networks 
including the Internet, without compromising the security of the important and critical mformation 
in the digital file. The dispersal channels can include any of all of the following: networks, Internet, 
Virtual Private Channel. Telephone lines, Optical lines, Wireless, Fax, Documents, Verbal 
communication. 

[0035] The present invention introduces a method and apparatus for enhancing the 

survivability capabilities of an organization and its networks. If networks get damaged, the 
decryption capability, such as PKI, is likely to be compromised, or at a minimum, suspended. In 
such instances, the invention enables continuation of work on channels, which need not be secure. 
In addition, the dispersion of information guarantees maximum partial reconstitution to documents 
and transactions, or total reconstitution to documents and transactions benefiting fi-om backup at 
distributed storage facilities. 

[0036] The present invention inttoduces a method and apparatus for delivering security for 

inter-connecting networks. It enables security for closed networks connecting to the Internet and 
other open networks. The Internet infrastructure and open networks are not secure. Even secured 
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closed networks, such as VPNs, are not secured enough. The critical content of documents is the 
critical asset of the organization and must be highly secured, with maximum reliability, full 
transparency and instant accessibility. To remain competitive, organizations must maximize utility 
of the critical data within their documents, files, databases and servers. The securing of such 
documents must not be at the expense of compromising the access or processing speed of such 
documents. The invention enables work in plain text, as well as with encryption. Working in plain 
text reduces the computing performance overload. 

[0037] The present invention introduces a method and apparatus for delivering information 

flow control in decentralized environments. Protection of privacy and confidentiality of information 
represents a long-standing challenge, The challenge has become much bigger with the expansion of 
the Internet, which has created decentralized networks. Parties, who do not know or trust each other, 
have to exchange information. The invention enables free flow and sharing of information between 
parties by removing burdening security restrictions and creating top security with a controUed- 
release of the security sensitive content in the documents. The technology enables top security 
through introduction of user and organization's ownership and control of the critical granular data 
in documents. 

[0038] The system introduces an additional layer of access controls at the granular level of 

the user document. In order to view the reconstructed critical information the user would need to be 
verified by additional access controls at the data storage level. The user access code or a physical 
key enables release of data from the storage. Today's access controls do not stop the user from 
distributing documents to other parties. The inventions fined grained controUed-release mechanism 
releases the critical information, only under a required set of circumstances and access validation, 
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The invention enables the user ownership of his security sensitive critical data and conditions for its 
release and dissemination. The user has the option to hide the critical data through declassification 
process and release through a reclassification process in which the critical data would be 
reconstituted in the document. 

[0039] The present invention introduces a method and apparatus for delivering 

Compartmentalization of security sensitive content by leveled access to users. The invention creates 
leveled sharing of information, for example such that persons with level 3 access will have keys for 
encryption type RS A persons with level access 2 will have access to Blowfish encryption within one 
document. 

[0040] The present invention introduces a method and apparatus for enabling more use of 

distributed and dispersed storage including ASPs (application service providers). There is a major 
human tendency to re&ain from sending important documents to web storage locations because of 
potential security breaches. This cultural issue is both driven by psychological issues and well- 
founded security concerns. The retention of those docxmients as is in physical proximity or locked 
security, provides actual security but precludes gaining any utility from those documents in a 
fimctional bxxsiness setting. Instead the invention enables functional distribution of those documents 
without the security sensitive data, and a controlled-release of some or all of the extractions in a 
granular way in order to support business activities while retaining security. 
[0041 ] The present invention introduces a method and apparatus for enabling lower storage 

costs. The extraction process declassifies and downgrades mission critical documents. The 
downgrading and transformation of a critical document into a non-critical document, enables storage 
in less secured and lower cost storage. Taking advantage of this security-initiated, extraction process 
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can yield substantial storage cost savings. The invention enables a high return on investment ROI 
for system storage cost arbitrage. Splitting the data into critical and non-critical enables 20 to 90% 
savings on storage cost. 

[0042] The present invention delivers an automated security risk management system that 

creates added in-depth security defenses at the semantic-level as well as creation of controlled- 
release mechanisms at the storage-level with significantly reduced performance overhead 
requirements. 

[0043] The present invention presents a technology which answers the security demands as 

required by Committee on Information Systems Trustworthiness of the National Research Council. 
The Committee's report. Trust in Cyberspace (1999), defines the security paradigms needed for a 
safe future. The report states: The substantial commercial off-the-shelf (COTS) makeup of a network 
information systems, the use of extensible components, the expectation of growth by accretion, and 
the likely absence of centralized control, trust, or authority demand a new approach to security: risk 
mitigation rather than risk avoidance; technologies to hinder attacks, rather than prevent them 
outright; add-on technologies and defense in depth; relocation of vulnerabilities rather than their 
elimination; none of the existing or security technologies addresses these needs in whole. The 
invention breakthroughs this barrier by providing a single system which implements each one of 
those four elements in a unified way. The invention controls information flow in centralized and 
decentralized environments, through controUed-release of information within distributed systems. 
[0044] The present invention implements security measures while accommodating the 

performance needs of a network. The invention provides a method and apparatus to ease overhead 
performance on congested computer networks. It can adjust the security defenses based on the 
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performance needs of the network. Many security systems overburden the already burdened 
computing environment in terms of computational overhead, labor, and training requirements. The 
invention enables to ease the overhead performance of a network by transforming from high 
overhead performance, encryption methods, and other security methods, to the method presented by 
this invention, 

[0045] The present invention minimizes the time of exposure of the important content within 

a document. The invention enables to separate the important content from the rest of the document 
for substantial periods of time, thereby minimizing substantially the exposure to risk. It is possible 
for example to extract the important content from the document and release it for reconstitution only 
when the user will open the document. In such situations the important content could for example 
be time and unexposed for over 99% of the time and exposed for less than 1% of the time, which 
lowers the risk substantially. 

[0046] The present invention provides a security risk management method and system to 

minimize security risks. The invention enables minimization of security risks by: Automatic 
separation and extraction of granular critical data from the core document. Dispersal of the extracted 
critical data groups to different secured storage locations. Reconstitution of the critical data in 
document for limited time, to minimize exposure to risk. Partial reconstitution, of the critical data, 
in core document, through a controlled release of granular critical data. Granular controlled release 
of data to specific authorized people only. 

[0047] The present invention provides a controlled release security mechanism to enable the 

release of content and granular content from storage locations in a centralized and decentralized 



20 



environment. The controlled release mechanism enables release of the appropriate content to the 
authorized party at the right time under the right circumstances. 

[0048] The present invention provides a security solution against damage by insiders. 

Studies show that insiders cause 70%-85% of the damage. These nine innovations are described in 
detail as follows: The invention enables insiders and employees to work with documents while 
managers and owners control the release of the critical prioritized information. The control is 
granular, thereby enabling continued work with the rest of the content in the document. The 
objective is to empower the user with the highest security while enabling him maximum sharing and 
delivery flexibility. This enables free flow of information between closed networks and public 
networks, such as the Internet, without compromising the security through extraction of important 
and critical content. The user can transport documents through various networks and e-mail services 
knowing that the critical information, which is still under control, and is not compromised. 
[0049] The present invention provides an automatic security system in order to overcome 

human flaws that create security vulnerabilities. Human engineering flaws are the cause of 90% of 
security vulnerabilities. For example, passwords are exposed through human fault enabling reading 
of plain text before it is encrypted. The invention enables an automatic process of appropriate 
response to security threats in an objective way and on an on going basis. 

[0050] The present invention provides an automatic security system in order to reduce human 

labor, and training costs. 

[0051] The present invention provides protection for important granular content within a 

document. A feature lefl: out in computer development is the protection and automatic protection of 
granular important content in a document. In every facet of life critical assets are immediately 
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protected. For example, credit cards and cash are protected in a wallet, important items at home are 
placed in closets, wall units, cabinets and safes. 

[0052] The present invention provides an alternative method to encryption. Mathematical 

security and encryption could be broken. Discovery of a mathematical equation for a shortcut of the 
factoring of prime numbers would be make mathematical security and encryption extremely 
vulnerable. 

[0053] In 1999 a 512-bit RSA key was broken- at that time 95% of keys in e-commerce were 512 

bits long. U.S. government 56-bit Data Encryption Standard was cracked in just 22 hours by the 
Freedom Foundation. 1 00, 000 PCs were connected with a supercomputer which enabled the testing 
of 245 billion keys per second. 

[0054] The invention provides an automated security risk management system. The system 

automatically responds to attacks by matching the defenses level to the level of threats The system 
responds to security threats through the following mechanisms: ( 1 ) controlled extraction of sensitive 
security data: in normal circumstances, extractions will take place according to pre-set rules; in threat 
situations, additional extractions will take place to deliver higher security; in an attack, additional 
substantial amounts of critical data will be extracted to deliver the highest security; (2) controlled 
dispersal to storage locations; in normal circumstances, dispersal to different storage locations 
according to pre-set rules will take place; in threat and attack situations, more dispersal to more 
storage locations, via additional communication channels will take place; and (3) controlled release 
of extracts for reconstitution; controlling amount of extracts released for reconstitution; controlling 
time of exposure of extracts in reconstitution; limiting access to specific people; and limiting access 
to specific times, 
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[0055] The present invention defends against devices like keyboard sniffers and mouse 

sniffers that can read information keyed into the computer and transmit it to an adversary. The 
invention enables to input security sensitive items through data input devices other than the 
keyboard. For example credit card numbers can be inputted through a hand held wireless devise. The 
inputted data would be transferred to storage for possible reconstitution. 

[0056] The present invention defends against as devices that intercept electromagnetic 

signals from computers, monitors, printers, and keyboards. For example the Van Eck receptors 
which can read information off the screen the display screen. The invention enables separation 
contents of document into two or more displays thereby limiting the potential damage of 
electromagnetic eavesdropping. 

Brief Description of the Drawings 
[0057] Further objects and advantages of the present invention can be found in the detailed 

description of the preferred embodiments when taken in conjunction with the accompanying 
drawings in which: 

[0058] FIG. 1 A diagrammatically illustrates a basic system diagram showing filtering and 

storing extracted data and remainder or common data and, in an enhanced embodiment, generating 
and storing a map. 

[0059] FIG, 1 B diagrammatically illustrates a system diagram showing reconstruction of the 

data, various security clearances and both electronic reconstruction and visual reconstruction. 
[0060] FIG. 2 diagrammatically illustrates a system showing major components of a single 

personal computer (PC) system, a networked system with several PCs (a LAN or WAN) and the 
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network coupled to a telecommunications system and the Internet and shows the interconnection 
with a server and multiple, Internet-connected memory units. 

[0061] FIG, 3 diagrammatically illustrates a basic flowchart showing reconstruction for 

various security levels. 

[0062] FIG. 3 A diagrammatically illustrates interleaving district data into different memory 

locations in a video memory. 

[0063] FIG. 4 diagranunatically illustrates a flowchart showing one embodiment of the 

principal portions of the data security program. 

[0064] FIG. 5 diagrammatically illustrates a flowchart showing the basic elements of the 

reconstruction process for the data security program. 

Detailed Description of the Preferred Embodiments 
[0065] The present invention relates to a data security system, a methodology of securing 

data on a personal computer (PC) system, on a computer network (LAN or WAN) and over the 
Internet and computer programs and computer modules and an information processing system to 
accomplish this security system. 

[0066] It is important to know that the embodiments illustrated herein and described herein 

below are only examples of the many advantageous uses of the innovative teachings set forth herein. 
In general, statements made in the specification of the present application do not necessarily limit 
any of the various claimed inventions. Moreover, some statements may apply to some inventive 
features but not to others. In general, unless otherwise indicated, singular elements may be in the 
plural and vice versa with no loss of generality. In the drawings, like numerals refer to like parts or 
features throughout the several views. 
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[0067] The present invention could be produced in hardware or software, or in a combination 

of hardware and software, and these implementations would be known to one of ordinary skill in the 
art. The system, or method, according to the inventive principles as disclosed in connection with the 
preferred embodiment, may be produced in a single computer system having separate elements or 
means for performing the individual functions or steps described or claimed or one or more elements 
or means combining the performance of any of the functions or steps disclosed or claimed, or may 
be arranged in a distributed computer system, interconnected by any suitable means as would be 
known by one of ordinary skill in the art. 

[0068] According to the inventive principles as disclosed in connection with the preferred 

embodiment, the invention and the inventive principles are not limited to any particular kind of 
computer system but may be used with any general. purpose computer, as would be known to one 
of ordinary skill in the art, arranged to perform the functions described and the method steps 
described. The operations of such a computer, as described above, may be according to a computer 
program contained on a medium for use in the operation or control of the computer as would be 
known to one of ordinary skill in the art. The computer medium which may be used to hold or 
contain the computer program product, may be a fixture of the computer such as an embedded 
memory or may be on a transportable medium such as a disk, as would be known to one of ordinary 
skill in the art. 

[0069] The invention is not limited to any particular computer program or logic or language, 

or instruction but may be practiced with any such suitable program, logic or language, or instructions 
as would be known to one of ordinary skill in the art. Without limiting the principles of the 
disclosed invention any such computing system can include, inter alia, at least a computer readable 
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medium allowing a computer to read data, instructions, messages or message packets, and other 
computer readable information from the computer readable medium. The computer readable 
medium may include non-volatile memory, such as ROM, flash memory, floppy disk, disk drive 
memory, CD-ROM, and other permanent storage. Additionally, a computer readable medium may 
include, for example, volatile storage such as RAM, buffers, cache memory, and network circuits. 
[0070] Furthermore, the computer readable medium may include computer readable 

information in a transitory state medium such as a network link and/or a network interface, including 
a wired network or a wireless network, that allow a computer to read such computer readable 
information. 

[0071] In the drawings, and sometimes in the specification, reference is made to certain 

abbreviations. The following Abbreviations Table provides a correspondence between the 
abbreviations and the item or feature. 

Abbreviations Table 



A-com 



computer or memory store for common or remainder data 



B-ext 



computer or memory store for extracted data 



bd 



board 



CD-RW 



compact disk drive with read/write feature for CD disk 



CPU 



central processing unit 



doc 



document 



dr 



drive, e.g., computer hard drive 



e 



encryption 



ext-data 



extracted data 
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1/(J 


input/output 


I-com 


Internet storage for common or remainder data 




iniemei siorage lor exiracteci aaia 


loc 


location 


mem 


memory 






icl 


iCiCaoC 


rccj 


rcqucsx 


sec 


security 


sys 


system 


t 


time 


tele-com 


telecommunications system or network 



[0072] FIG. 1 A diagrammatically illustrates the basic processes for establishing a secure 

storage of information, generally identified herein as "data." "Data," as used herein, includes any 
data object, e.g., text, images, icons, etc. Sound bites and video images may also be extracted data. 
A source document 1 00, sometimes referred to as a "plaintext," is passed through a filter 1 02. Filter 
102, in a most basic sense, separates out common text or remainder data 1 04 from uncommon text, 
words, characters or icons. The security sensitive words, characters or icons are separated from 
remainder or common text 104 as extracted text 106. It should be noted that although the word 
"text" is utilized with respect to remainder text 104 and extracted text 106, the text is a data object 
and includes words, phrases, paragraphs, single characters, portions of words, characters, whole or 
partial images, or icons. In a basic implementation, filter 102 may utilize a dictionary such that 
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words present in the dictionary (common words) are separated from the source plaintext document 
100 and placed into remainder document or common data file 104. The uncommon words 
(extracted-security sensitive words), not found in the dictionary, would be placed in an extracted text 
or extracted data file 106. For example, a business may wish to impose a security system on a 
contract document such that the names of the contracting parties (not found in the dictionary) and 
the street names (not found in the dictionary) would be stored in extracted data text file 106. The 
common text or remainder data would be stored in remainder data file 104. In the illustrated 
embodiment, remainder data file 104 also includes place holders which enables the extracted data 
to be easily inserted or set back into the remainder data file. 

[0073] The security sensitive words, characters or icons may be any word, phrase, letter, 

character, icon (full or partial), image or whatever as pre-defined or as established by the user. The 
user may specifically design the filter, begin with a dictionary to define common terms, identify any 
additional security sensitive words, letters, images, icons, partial versions of the foregoing or any 
other granular aspect of the plaintext. After defining the filter and accepting the data input, the 
system filters the plaintext and separates extracted data (security sensitive items) firom the remainder 
data. 

[0074] In a basic configuration, the common text or the remainder data is stored in common 

storage memory 108. This common or remainder data store is identified as A-com generally 
referring to a segmented memory in a PC or a computer A in a network (LAN or'WAN), Remainder 
data storage 108 may include a confirm storage signal function 1 1 1 to send back a confirm storage 
signal to the data input device generating source plaintext document 1 00. The extracted data file 1 06 
is stored in a different memory computer storage 1 10 (B-ext). In a preferred embodiment, memory 



28 



segment 1 08 (A-com) is at a different location than computer storage memory segment 1 1 0 (B-ext). 
In a PC embodiment, memory A-com is a different memory segment than memory B-ext. In a 
networked embodiment, computer storage 108 may" be on a different computer as compared with 
computer storage 110. In any event, the remainder text is stored in a memory A-com and the 
extracted data or high security words, characters or icons are stored in memory B-ext. After storage 
of the extracted data in memory 110, a confirmation indicator 1 13 may be generated to the client 
computer or the computer handling source plaintext input document 100 (the originating computer 
system). 

[0075] In a further enhancement of the present invention, the computer or data input device 

handling source plaintext document 100 may also record the location of A-com 108 and B-ext 110. 
The location data is called herein "map." A memory mapping function is utilized. The map may 
be stored in a third memory location 112. Memory location map 112 may be a segment of the 
memory of the data input computer originating plaintext 100. 

[0076] As a further enhancement of the present invention, the user, prior to initiating the 

security system, may be given a choice of filtering out all the uncommon words or words not found 
in the dictionary and adding certain security sensitive words, characters or icons to filter 102. The 
added words or terms are filtered out with the uncommon words. Of course, the user may be 
required to manually input all security words or download the security word filter from the Internet 
or another system on the LAN. For security systems having multiple security levels, a plurality of 
filters would be created, each filter associated with a different security level. Further, multiple 
security levels would require, in addition to remainder text document or data 104, a plurality of 
extracted data documents 1 06. The common or remainder text document or data 1 04 would still be 
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stored in remainder computer storage A-com 108. However, each extracted data document 106 
would be stored in a respective, separate computer memory segment or computer B-ext 110. 
Separate storage of a plurality of extracted data at multiple, separate locations in B-ext is one of the 
many important features of the present invention. 

[0077] In view of increasing levels of security relating to (a) the storage location A-com; (b) 

the transfer of remainder text document 1 04 to memory computer storage A-com 1 08; (c) the storage 
of map 112; (d) the creation, storage or transfer of filter 102; (e) the storage of extracted data at 
memory storage B-ext (whether singular or plural storage sites); and (f) the transfer of extracted data 
thereto, the system may include an encryption e feature. The encryption e fimction 1 1 5, 1 1 7 and 1 1 8 
is diagrammatically illustrated in FIG. lA. 

[0078] FIG. 1 B generally diagrammatically illustrates the maj or features of a reconstruction 

routine or system. The user, typically at a computer termuial, inputs a reconstruction request 120. 
The system first executes a security clearance protocol routine 1 22 in order to determine whether the 
user has the proper security clearance. The security clearance may be thought of as a security 
clearance control. If multiple users are permitted access to the documents and those multiple users 
have different security clearances, the security clearance protocol determines the level of security 
clearance and, hence, the full or partial reconstruction of the plaintext. The security code input by 
the user is checked against a security code database or list 124. Clearance is provided in step 126. 
The location of the map and, hence, the location of the remainder data A-coni 108 and extraction 
is provided to the user's computer in step 128. This may include obtaining a copy of the map 130 
showing the location of memory segments in (a) the local computer; (b) the LAN or WAN; or (c) 
the Internet storage sites. The storage segments are A-com 108 and B-ext 110. The common or 
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remainder data is downloaded or transferred or made available to the user's computer as shown at 
the output of map location and data step 128. Typically, the extracted or security sensitive data from 
B-ext is downloaded. As described hereinafter, the data can be reconstructed as a complete 
electronic document in function 130 or may be reconstructed only as a visual reconstruction in step 
132. Visual reconstruction is discussed later. Function 130 operates as a compiler to gather the 
extracted data and remainder data into a single plaintext document. 

[0079] If remainder data in A-com memory 108 and extracted data in B-ext computer 

memory 1 10 is encrypted, the reconstruction process includes a decryption step. Encryptors and 
decryptors are relatively well known by persons of ordinary skill in the art. Further, the filter 102 
(FIG. 1 A) may include some encryption routine operating on the data object (plaintext) during the 
filtering. A simple encryption may include substituting "dummy" text or images for the security 
words and keeping a pointer to an encryption key document mapping tiie security words with the 
dummy words. The filter may be stored or may be destroyed at the option of the user. Storage of 
the filter impacts the degree of security of the entire data system but storage of the same filter enables 
the user to reuse the filter at a later time. Encryption of tiie stored filter increases the security of the 
data. Creation and storage of map in memory 1 12 also impacts the degree of security of the system. 
However, if the filter 102 is desti-oyed and all copies of tiie map are destroyed on the user's computer 
originating plaintext document data 100, and the map is stored offsite in a third computer memory 
location 1 1 2, tiiis offsite map storage may enhance the degree of security of tiie data. The originating 
computer processing plaintext 100 may be scrubbed to remove all reference and copies of tiie 
plaintext, remainder text, extracted data map storage data, etc., i.e., a deletion routine may be 
employed on tiie data input computer. 
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[0080] FIG. 2 diagrammatically illustrates a personal computer or PC computer system 140, 

a second PC or computer 142, and a third PC -3. PCs 140, 142andPC-3 are connected together via 
a network 145(LAN or WAN) and are also connected to an input/output device 146 that may be 
generally described as a router or a server to an outside communications system. The input/output 
device 146 is connected to a telecommunications system 148 which leads to Internet 150. The 
Intemet is a global computer network. Internet 1 50 is coupled to a plurality of servers, one of which 
is server 152. Server 152 may be designated as an application service processor ASP. Intemet 150 
also includes various computer memory storage devices such as computer storage I-com 154, 
computer storage I-ext 156 and computer storage map 158. Computer storage enabling the store of 
extracted data includes a security level clearance module 157. Similarly, map computer storage 1 58 
includes security level clearance module 159. 

[0081] As stated earlier, the present data security system can be implemented on a single 

personal computer 140. In this case, different memory segments or hard drive 168 may be used for 
A-com and B-ext. Typically, PCs include a keyboard or data input device 161, a display 163, a 
central processing unit CPU 165, a video board 167 having video board memory 169, a fixed disc 
hard drive 168, a RAM 166, and input/output device 164, a removable memory media floppy drive 
162 and a removable compact disk (CD) read-write (CD-RW) device or drive 160. The system may 
include other removable disk drives, tape drives, or flash memory units. Internal units CPU 165, 
video board 1 67, hard drive 168, RAM 1 66 input/output device 1 64, floppy drive 1 62 and CD-ROM 
device 160 are all coupled together via an internal bus 171 . Bus 171 represents a plurality of buses 
as is known to persons of ordinary skill in the art. 
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[0082] One methodology of implementing the present invention utilizes distinct memory 

segments which may be designated in one or more of the following: hard drive 168, memory in a 
removable disk in floppy drive 162, memory in a removable CD disc in CD-RW device 160, and, 
to a very limited extend, RAM 166, In this manner, the user may select, generally at the outset of 
the process, that the extracted data memory storage B-ext 1 10 be stored on a floppy (removable 
memory) via floppy drive 1 62 or a CD via CD-RW drive 1 60. The user can then simply remove the 
floppy or the CD and carry it with him or her. To reconstruct the data, the operative program, 
generally discussed above would have access to the floppy or the CD and particularly the memory 
location of the data on the floppy and the CD in order to reconstruct the entire plaintext document 
100 (see FIG. 1 A). Alternatively, different portions of hard drive 168 may store A-com and B-ext. 
Of course, the computer system may utilize tape drives and memories or flash card, programmable 
memory, 

[0083] In a local area network or Avide area network implementation, PC 142 includes 

memory similar to memory imits described in PC 140 and a memory segment may be set aside in 
PC 142 separate from the common data or remainder data storage typically placed on hard drive 168 
in PC 140. As a further expansion of the present invention, the extracted data (that is, the high 
security data), may be stored on computer storage I-ext memory unit 156 via Internet 150, 
telecommunications system 148 and router/server 146. In this manner, the common data or 
remainder data is stored on hard drive 1 68 and the highly sensitive data is stored off site in a secured 
location. Access to that secured location may be limited via security layer 157. If the user 
implements an encryption system (see encryption e 1 18 in FIG. 1 A), the extracted data is further 
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secured by the encryption during the transfer from computer 140 through network 145, router/server 
146, telecommunication system 148, Internet 150 and ultimately to computer storage I-ext 156. 
[0084] The present invention may also be embodied utilizing an Application Service 

Provider on server 152 and in a client-server network. 

[0085] An implementation of the present invention over Internet 150 most likely includes 

the use of a uniform research locator or URL for map memory computer 158, computer storage I-ext 
156, computer storage I-com 158 and ASP server 152, In a client-server environment, server 152 
acts as a server generally commanding the operation of client computer 140, Of course, persons of 
ordinary skill in the art recognize that the server may .be located on the local area network 145 rather 
than being interconnected with Internet 150 as shown in FIG. 2. The claims appended hereto are 
meant to cover the alternative embodiments, 

[0086] As an example of a client-server or web-based implementation of the present 

invention, the user at computer 140 may define the filter 102 as described above, and input data 
(plaintext) via keyboard 161 or load plaintext data from floppy drive 162 or CD-ROM drive 160 
into RAM 166. In any event, whether the plaintext data is input via keyboard 161 or copied or 
accessed in from floppy drive 162 or CD-RW drive 160, the plaintext data is filtered as discussed 
above in connection with FIG. 1 A. Prior to filtering, it would be appropriate for the user at computer 
140 to identify where the remainder data or common data will be stored and where the extracted or 
high security data would be stored. The system is sufficiently flexible to enatle the user to select 
local storage on different memory segments of PC 140 (hard drive 168, floppy drive 162, CD-RW 
drive 160) or be flexible enough to enable user at computer 140 to designate off site storage of the 
high security data (extracted data) and/or the common or remainder data. The off site data storage 
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process may include activating server 152 and enabling the server to take over the process directly 
from user 140. In other words, the user at computer 140 could call up the URL of the server 152, 
the server could request certain user information (user name, password), and would request data from 
the client computer to establish the filter pursuant to input selected by the user. The client compute 
may (a) filter the plaintext thereat or (b) send the data to the server for filtering. The server could 
store data either locally on computer 140 or remotely at computer memories 1 54, 1 56. After storage 
of the data at any of these locations, the server 1 52 may establish a map and store the map in memory 
location 158, Of course, remainder data and the map may be stored at ASP 1 52 or client computer 
140. The map, if stored at map storage 158, may be downloaded to the user at computer 140, The 
filter may be stored at computer 140 or may be stored at a secured location on server 152, 
Alternatively, the map could be destroyed on user computer 140. The filter could also be destroyed 
on user computer 140. Of course, the filter could be stored in a fourth remote location (not shown), 
different from I-com 154, I-ext 156 and map computer memory 158, Storage of the map and 
decryption keys is a critical, high security task. Appropriate security measures should be utilized to 
protect those items. Local removable memory storage on disc in floppy drive 1 62 or disc in CD-RW 
160 may be reasonable. All traces of the map, the filter, the encryption key, the extracted data, and 
possibly the remainder data may be scrubbed or deleted from all computer memories (by write-over 
or disc reformat routines) other than the "com" and "exf storage sites. Deletion of all URLs, links, 
etc, is also recommended for high security applications. Deletion systems are known to persons of 
ordinary skill in the art. 

[0087] To reconstruct the document, the user at computer 140 would be required to call up 

the URL of server 152 and input the appropriate security code. The server 152 would then call up 
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and download data from various memory locations whether they be memory locations on computer 
140 or memory locations I-com 154, 1-ext 156 and map memory 158. The system compiles the 
entirety of the plaintext document by gathering the dispersed components thereof or compiles partial 
reconstructions for different levels of security. By implementing different security levels, the system 
is dynamic enough such that server 1 52 can easily locate the various extracted data levels based upon 
various security codes representing different security levels, as those codes are input by the user at 
computer 1 40. Multiple security codes, at the inception and during the process, may be utilized. The 
user may be required to input security codes at multiple times during the reconstruction or 
compilation process. 

[0088] It should be noted that computer storage 154, 156 and 158 may be located on the 

same computer or may be located on different computers spread throughout the Internet. If the 
storage imits are different computers spread throughout the Internet, computer storage 154, 156 and 
158 would each have their own URL or Uniform Resource Locator. In any event, during 
reconstruction, the server 1 52 gathers the information and downloads the information into RAM 166 
of computer 140. This download may include a first download of the common or remainder data 
from I-com 154. At a separate time, which may or may not include a decryption routine, the 
extracted from I-ext 156 is downloaded. Preferably, other than inputting initial security codes and 
any required or desired intermediate security codes, the system operates automatically without 
further input from the operator at client computer 140. The download of both data sets may be 
simultaneous in that the download is not humanly perceivable. This is especially true if storage in 
different memory locations in PC 140 is utilized. 
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[0089] The role of server 1 52 may be expanded or reduced dependent upon the desires of the 

user and the degree of security necessary. For example, server 1 52 may only enable separate storage 
of extracted data in I-ext 156. In this limited role, server 152 w^ould require the input of a proper 
security code and clearance prior to identifying and enabling the download of extracted data from 
I-ext 156. 

[0090] In an expanded mode, server 1 52 may be involved in filtering the data, extracting the 

security sensitive words, characters or icons to obtain extracted data and remainder data thereat, 
separately storing the extracted data from the remainder data (extracted data being placed in 
computer memory I-ext 156 and remainder data being stored in common remainder data memory 
I-com 1 54) and then permitting reconstruction via separate or combined downloads of the remainder 
data and the extracted data into computer 140. 

[0091] FIG. 3 diagrammatically illustrates a system diagram for various reconstruction 

routines. A complete reconstruction is shown as security level path A. This involves an electronic 
integration of plaintext in step 202 resulting from the complete electronic reconstruction of document 
1 00. For example, a merge may occur between the extracted data and the remainder data or common 
text data. The document is completely compiled in this process. Placeholders in the remainder 
document are utilized to locate and insert the extracted data. Most likely, there will be no process 
controls imposed on the integrated document as shown in step 204. In other words, if the user at 
computer 140 has the proper security clearance, he or she could download or recreate the entire 
original source, plaintext document and the user would be entitled to edit the document or change 
it in any way or copy it and reproduce it. 



37 



[0092] The second level of security, path B, results in storage of the common or remainder 

data in a different memory location on the hard drive 1 68 as compared with the extracted data. This 
is noted in step 206. Another words, in a simple example, hard drive 168 or RAM 166 would hold 
a copy of a remainder data document and another copy of the extracted data document, that is, two 
separate documents. Since two documents are available in RAM 166 or hard drive 168, these 
documents are stored in different locations in the memory. In step 208, a map showing the memory 
location of the common or remainder document and the extracted data document is provided to 
computer 140. Step 210 commands the processor CPU 165 in computer 140 to interleave the 
extracted data with the common or remainder data in the video board memory. In this process, the 
extracted data would typically have placeholders for the missing remainder data. Otherwise, control 
codes to locate the extracted data into the remainder data would be executed by CPU 1 65 to properly 
place the extracted data into the "visual space" of the placeholders in the remainder data document. 
The extracted data document may have placeholder for the remainder data. Some type of register 
between the two image documents may be necessary. The compiler, in this embodiment, gathers the 
document elements and visually compiles and presents the plaintext to the user. 
[0093] FIG. 3 A diagrammaticaliy shows that video board memory 169 is loaded with 

remainder or common data 1 and a different location of the video memory is loaded with extracted 
data 1 . The next video memory location is loaded with common data 2 and then a different video 
memory location is loaded with extraction data 2. Since the refresh rate of computer monitor 163 
is fast, the display 1 63 will show the common or the remainder data and then show the extracted data 
such that the user could not humanly perceive a difference in the docxmient. However, the user could 
not copy the document from display screen 163 (a "screen shot") since the document is never 
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electronically integrated into a single document. There is only a visual presentation of the combined 
document by interleaving the extracted data with the common or remainder in the video memory 
169. Step 212 notes that the user may be limited in his or her ability to process, edit and store the 
reconstructed and presented plaintext document. 

[0094] Security level path C recognizes in step 214 that the data is stored m different 

memory or computer locations. In this situation, two video boards, video board A and video board 
B are shown as board 2 1 6 and 2 1 8 . Video board 2 1 6 drives display monitor 220. Video board 2 1 8 
drives display monitor 222. Display screens 220, 222 are overlaid atop each other. Video board 21 6 
is fed with common or remainder data from the remainder data store (see I-com store 154 in FIG. 
2) and video board 2 1 8 is fed with the extracted data from the extracted data store, for example, I-ext 
store 1 56. In this manner, as noted in step 224, the user is presented only with a visual presentation 
or compilation of the plaintext. Since there was physical separation between video monitor 222 and 
video monitor 220, there is no electronic integration at all of the plaintext document. Hence, the 
ability for the user to do any significant editing on the plaintext document is blocked or prohibited 
because the user only has access to either the data on video board 216 or the video board 218. 
[0095] Security level path D shows that the extracted data may be parsed or fiirther separated 

based on a plurality of security clearances in step 226. Step 228 recognizes that the system can 
repeat process and security process paths A, B and C only with portions of the extracted data 
presented to the user based upon the user's security clearance. 

[0096] FIG. 4 diagrammatically illustrates the major components of a flowchart for the data 

security program. It should be noted that this flowchart may be truncated to limit user selection of 
certain items. The system would be pre-set to contain these features. Step 230 mitializes the system. 
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Step 232 enables the user to designate various levels of security for the activity which he or she will 
soon engage. The system, in step 234, enables the user to define the levels of security parameters. 
The following Security Table gives some examples of the type of security that may be available to 
the user. 

Security Table 

to whom 
to where 

when (time of day, day of week, month, floating but predetermined 
time frame) 

why (purpose, match purpose to other security parameters or to 
certain predetermined* criteria) 

how (through what medium (LAN, WAN, Internet, direct dial link), 
download to what site or destination) 

how long (duration) the reconstruction process will be permitted per 
each security clearance level 

how much (different security levels enable reconstitution of 
documents and data with different amounts of secure data therein) 
timing systems may require synchronization for a standard clock (i.e., 
atomic clock) 

[0097] As an example of a truncated or pre-set program, a client-server system over the 

Internet may have URLs designating storage sites and an ASP 152 (FIG. 2) controlling storage. In 
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this pre-set system, the user does not select the sites. The sites may be randomly selected by ASP 
152. 

[0098] The user, in step 326 can designate the location of the filter, the common storage area 

for the remainder data, the extraction data storage and potentially multiple data storage areas or 
segments. Step 238 permits the user to engage or disengage encryption and, if engaged, establish 
the degree of encryption for the system. Step 240 enables the user to define the parameters of the 
filter. The user can retrieve a preexisting filter or may define a new filter for each data security 
session. These filters may consist of dictionaries or any type of compilation of words, characters, 
icons or pixel formation or any indication that can be perceived by the computer system. Granular 
extraction of data elements in a data object may be permitted. Step 242 recognizes that the user 
either inputs a preexisting plaintext document or types data into the system. In any event, the 
plaintext document is fed through the filter. Step 246 extracts the security data firom the input 
document. Step 248 stores the extracted data. The extracted data may be encrypted prior to storage. 
Step 250 conducts an error check on the extracted data. This error check is helpfiil in discerning 
problems in the storage of the data prior to closing down the data security system. Step 252 stores 
the common data or the remainder data. Step 254 conducts an error check on the common or 
remainder data. The decision step 256 determines whether the user has selected a "destroy filter" 
command. If not, the filter is stored with or without encryption in step 257. If YES, the filter is 
destroyed with a deletion routme. Typically, deletion is complete erasure of all traces of the file 
including, in high security systems multiple write-overs or disc reformatting. Step 25 8 stores a map. 
The map may be stored locally or remotely as described earlier. The system ends in step 260. All 
traces of these data elements or objects may be swiped clean or removed firom whatever computer 
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system generated the data objects or processed them, other than the memory storage locations. 
Deletion of data also includes the concept of deletion of data transmission paths, URLs, storage site 
locations and all temporary memory stores. Deletion of file location in the root directory of hard 
drive 168 of computer 140 is preferable in high security systems. 

[0099] FIG. 5 diagrammatically illustrates basic flowchart features for the reconstruction 

process. Step 302 accepts a request to reconstruct the secured data. Step 304 queries a local map 
and the security system or protocol. In a preferred embodiment the user would have to input several 
passwords, one of them being a local password on computer 140. A local map which may be 
accessed only through the password, may simply identify the URL of server 1 52. Decision step 306 
determines whether the local password is acceptable. If not, and error step is indicated in step 307, 
the attempt to log on to the security system is noted in step 309 (an audit trail), and the system either 
branches to repeat step 3 1 1 or bars the user from further activity in step 313. 
[0100] Returning to decision step 306, if the password is locally acceptable, the YES branch 

is taken and the system executes step 308 which releases a reconstruction request to the common 
storage facility I-com 154 or A-com 108 (FIGS. 2 and 1 A-B). The system in step 3 10 logs the user 
in, as well as time and date and the data regarding the request. In step 3 12, a download from the 
conomon data storage is provided to RAM 166 or hard drive 168. 

[0101] In step 314, a query is made to obtain the remote map from the remote security 

system. The decision step 316 indicates that the user again successfully inputs his or her security 
code. If not, error routine 3 1 7 is activated, the password failure is noted in step 3 1 9 (an audit trial), 
and the user is given an opportunity to repeat in step 321 or is barred or prohibited from further 
activity in step 323. If the user has correctly input the security code, the system in step 318 releases 
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the keys (to decrypt) and the map and releases the reconstruction request to the remote storage for 
the extracted data. This could be computer storage I-ext 156 or computer storage B-ext 110. In step 
320, the user's access to the extracted data is logged in along with the time and day and type of data 
request. In step 322, the system downloads the extracted data into RAM 1 66 and/or hard drive 1 68 
of computer 140. In step 3 24, an error routine is operated on the extracted data in order to insure that 
the extracted data properly matches the common or remainder previously stored. Decision step 326 
determines whether the error routine properly generates the correct count or output. If not, the 
system in step 327 indicates an error, in step 329 the system deletes the common files and the 
extracted files and the system in step 331 logs in the failed attempt. If the error checking routine on 
the extracted data is acceptable, the YES branch is taken from decision step 326 and the system, in 
step 328, proceeds to display the plaintext document or to integrate the plaintext docxmient pursuant 
to the security clearance initially input by the user. Step 330 ends this process. The end process may 
entail encrypting the data again and swiping clean all traces of data objects from the memory stores 
and computer handling units. Of course, every use of encryption requires decryption of the data prior 
to reconstruction. 

[0102] The system may incorporate various types of security systems or routines. 

pass word 
pass phrase 

multiple choice questions and answers 

initial, intermediate and subsequent security clearance routines 
biometric security routines (voice, fingerprint, signature, eye or retina 
scan) 



43 



[0103] The reconstruction routines may be interrupted or the security system automatically 

activated or initiated upon the occurrence of externally generated triggers or upon certain 
predetermined conditions or conditional events. Limited extraction, security clearance, release of 
data and reconstruction limits may be imposed. 

[0104] The display systems 220, 222 in FIG. 3 include CRT monitors, LCD screens, 

projection screens and combinations of those systems, 

[0 1 05] The audit trail to monitor reconstruct and reconstruction attempts may include adding 

a time/data stamp to the remainder data and/or the extracted data prior to storage and a cross-check 
to the audit trail log during the reconstruction process. 
[0106] Placeholders in the remainder document may be: 

blank spaces 

data symbols or elements " — " or "xxx" 
false data 

clearly erroneous data "ABC Company" or "Baker" 

chaff or hash marks 

messages 

bar code 

serialization data 

alerts 

links to other data objects 
null set indicators "[ ]" 
URL or website addresses 
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[0107] It is believed that the present invention is faster, during reconstruction, than standard 

encryption techniques, on the order of 100 to 1,000 faster. 

[0108] The system and method described herein may operate substantially automatically, that 

is, without operator intervention, other than the security clearance function. The clearance function 
does require some type of operator authentication prior to retrieval of the extracted and remainder 
data. 

[0109] The system and the method may operate automatically in that the plaintext or 

originating data could be identified by a party desiring security. The system could obtain that data 
from any data input device (hard drive memory, floppy drive memory, flash card memory, personal 
data assistant (PDA), or any other type of data input device), filter the data, separate the extracted 
text or the remainder text, encrypt (or not encrypt) the data, separately store the extract and 
remainder data (all automatically, that is, without operator intevention). Hence, it is not necessary 
that the system operate with significant operator or manual intervention. Of course, the system may 
also operate on a plaintext document or data object that is being created "in real time" by an operator 
and keyboard, mouse or other type of data input device. 

[0110] The automatic operation of the system and the method can be caused by a triggering 

event. This triggering event may be a security attack (generating a trigger to start the gathering of 
plaintext, filtering, extraction and storing) or may be any other type of trigger such as a building 
burglar alarm, door alarm, fire alarm, or virus detection algorithm trigger. The event may be a time 
of day, week or month. It may be n seconds after the user stops typing on a keyboard. It may be a 
timed back-up feature. 
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[0111] Multiple filters may be utilized in the system and in connection with the method. 

These multiple filters may be useful in the operation of the system with a plurality of security levels. 
Each filter could filter out different levels of security sensitive items and each bundle or group of 
security sensitive items (from each distinct filter) could be stored at different computer storage 
locations. Multiple filters, multiple security levels and multiple storage areas may also include 
multiple encryption routines and decryption routines. Encryption and decryption routines can be 
related to the level of security of a particular group of data. 

[0112] Multiple maps may also be provided for singular or multiple storage of extracted data 

and remainder data. These maps may or may not indicate the originating point of the data. Maps 

can be parsed such that an intruder, upon discovery of a single map or map portion, could not locate 

the storage locations of all piece of the extracted data and remainder data. 

[0113] The concept of partial reconstruction also includes the concept that a portion of the 

plaintext would be reconstructed and the unreconstructed portions of the plaintext could be encrypted 

or could show blanks or other symbolic indicators. See the placeholder table above. 

[0114] Partial reconstruction of the plaintext also includes a concept that the security 

sensitive items or materials may be subject to different types of encryption. Hence, a single plaintext 

document may have multiple levels of security and multiple levels of encryption wherein each 

encryption has a different level of security assigned to it. 

[0115] The claims appended hereto are meant to cover modifications and changes within the 

scope and spirit of the present invention. 
[0116] What is claimed is: 
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